Intention extraction and semantic matching for internet FAQ retrieval using spoken language query

نویسندگان

  • Yu-Sheng Lai
  • Kuen-Lin Lee
  • Chung-Hsien Wu
چکیده

An FAQ (frequently-asked question) pattern consists of a question and a text document that answers the question and contains some additional remarks. As a query is similar to the FAQ’s question, the FAQ’s answer gives a possible answer or parts of the answer of the query. On the other hand, an FAQ’s answer may also contain information not concerning with the corresponding FAQ’s question but embed the answer for other questions. For a given query, therefore, the answer can be obtained from both FAQ question and answer. In this paper, we propose a framework for Internet FAQ retrieval by using spoken language query. We aim at two points: (1) extraction of the main intention embedded in a query sentence and (2) semantic comparison between a query sentence and an FAQ pattern. To evaluate the system performance, a collection of 1022 FAQ patterns and a set of 185 query sentences are collected for experiment. In intention extraction, 91.9% of intention segments can be extracted correctly. Compared to the keyword-based approach, an improvement from 78.06% to 95.28% in recall rate for the top 10 candidates is obtained.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Latent Semantic Inference for Agriculture FAQ Retrieval

FAQ system can make user find answer to the problem that puzzles them. But now the research on Chinese FAQ system is still on the theoretical stage. This paper presents an approach to semantic inference for FAQ mining. To enhance the efficiency, a small pool of the candidate question-answering pairs retrieved from the system for the follow-up work according to the concept of the agriculture dom...

متن کامل

Knowledge-based information retrieval from semi-structured text

We are developing a class of systems, called FAQ Finder systems, that use a natural language questionbased interface to access distributed text information sources, speci cally text les organized as question/answer pairs such as FAQ les (Hammond, et al. 1995). In using these systems, a user enters a question in natural language and the system attempts to nd an information source that answers th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000